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Abstract — A memoryless interference network model is intro- 
duced that is based on non-linear phenomena observed when 
transmitting information over optical fiber using wavelength- 
division multiplexing. The main characteristic of the model is 
that amplitude variations on one carrier wave are converted 
to phase variations on another carrier wave, i.e., the carriers 
interfere with each other through amplitude-to-phase conversion. 
For the case of two carriers, a new technique called interference 
focusing is proposed where each carrier achieves the capacity 
pre-log 1, thereby doubling the pre-log of 1/2 achieved by 
using conventional methods. The technique requires neither 
channel time variations nor global channel state information. 
Generalizations to more than two carriers are outlined. 

L Introduction 

The additive white Gaussian noise (AWGN) channel, suit- 
ably modified, is a good model for many problems encountered 
in practice. For example, a parallel AWGN channel is accu- 
rate for communication over copper cables with orthogonal 
frequency-division multiplexing (OFDM). An AWGN channel 
with multiplicative noise models multi-path fading for wireless 
communication. The capacities ||T| of these channels have been 
studied in great detail. In contrast, the capacities of fiber-optic 
channels have attracted less interest in the information theory 
community (see, e.g., the tutorial paper [2]). Perhaps this is 
because it was not until recently that it became necessary to 
communicate efficiently over fiber; optical fiber has long been 
viewed as having "bandwidth to burn". However, the relentless 
increase in traffic demand and advances in optical technology 
have made determining fiber capacity of great interest. 

II. Fiber Channel Models 

The fiber channel suffers impairments such as propagation 
loss, dispersion, and Kerr non-linearity. Optical amplifiers 
such as Erbium-doped fiber amplifiers (EDFAs) compensate 
the attenuation in fiber links without electronic regeneration, 
and as a result amplified spontaneous emission (ASE) noise 
becomes a significant problem. Dispersion arises because the 
propagating medium absorbs energy through the oscillations 
of bound electrons, causing a frequency dependence of the 
material refractive index fSl, p. 7]. The Kerr effect is caused 
by anharmonic motion of bound electrons in the presence of an 
intense electromagnetic field, causing an intensity dependence 
of the material refractive index |3 , p. 17, 165]. 



Let A{z,t) be a complex number representing the slowly 
varying component (or envelope) of a single mode, linearly 
polarized, electric field at position z and time t. Suppose we 
use a retarded-time reference frame with T = t — f3iz where 
Pi is the reciprocal of the group velocity. Suppose further 
that the ASE noise is negligible. The evolution of A{z,T) 
is then governed by the generalized non-linear Schrodinger 
(NLS) equation ^ p. 44, 50]: 
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where i — ^/—\, a is the attenuation constant, /32 is the group 
velocity dispersion (GVD) coefficient, 7 = n2a;o/(cAeff), n2 
is the non-linear refractive index, luq is the carrier frequency, 
c is the speed of light, and A^a is the effective cross-section 
area of the fiber One usually normalizes A{z, T) using 
which effectively lets one set a = (see ||3] p. 50, 64]). 

We are interested in studying the impact of non-linearities, 
so we consider the simplified model where /32 = 0, i.e., there is 
no dispersion or completely-compensated dispersion. Eq. ([TJ 
with a = and ~ has the exact solution |3j p. 98] 



A{L,T) = A{0,T)e 
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where L is the fiber length. In other words, Kerr non-linearity 
leaves the pulse shape unchanged but causes an intensity- 
dependent phase shift. The phase shift phenomenon is called 
self-phase modulation (SPM). 

Suppose now that two optical fields at different carrier 
frequencies uji and uj2 are launched at the same location and 
propagate simultaneously inside the fiber The fields interact 
with each other through the Kerr effect [3, Ch. 7]. Specifically, 
neglecting fiber losses by setting a = 0, the propagation is 
governed by the coupled NLS equations f3^, p. 264, 274]: 
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where Aj{z^T) is the time-retarded, slowly varying compo- 
nent of field j, j = 1,2, the (32 j are GVD coefficients, the 7^ 
are nonlinear parameters, and d = /3i2 — Pii where the j3ij 



are reciprocals of group velocities. We will assume that wi 
and L02 are sufficiently close so that we can set d = 0. We 
further simplify and choose /32i = P22 = 0. The coupled NLS 
equations ©-(Hli have the exact solutions |[3] p. 275] 

Ai(L,r) = Ai(0,T)e"^i^(l^^(°'^)l'+2|^^(0'^)l') (5) 
A2{L, T) = A2(0, T)e'i-'^^\^2(^''^^?+2\M(o,T)\-) 

where z = is the point at which both fields are launched. 
Kerr non-linearity again leaves the pulse shapes unchanged but 
causes interference through intensity-dependent phase shifts. 
The interference phenomenon is called cross-phase modulation 
(XPM). XPM is an important impairment in optical networks 
using wavelength-division multiplexing (WDM), see [2J. 

Equations (O-® generalize naturally to launching and 
receiving fields at different locations, and to using K fields 
with K > 2. However, it seems prudent to emphasize that 
ignoring dispersion, or memory, is considered unrealistic for 
optical networks. On the other hand, our results do show that a 
new method called interference focusing is needed to approach 
capacity without dispersion. It seems natural to expect that 
this method will be useful with dispersion also, and this is the 
subject of ongoing work. The reason we study a memoryless 
model is to take a first step in gaining understanding. 

To strengthen the link to realistic channel models, we point 
out that a 2-parameter phenomenological model that captures 
the effects of XPM in optically-routed WDM networks was 
recently proposed in |4|. The model is memoryless in that 
each received symbol Y is related to the transmitted signal 
only through the current transmitted symbol X as follows: 

Y = Xe**^" + Z (7) 

where $pAr is a Gaussian random variable with variance 
CiVar(|Xp) and ci is a parameter that accounts for system 
specifications, e.g., the number of WDM channels. Z is 
AWGN with variance cr| = iV + C2V^{\X\Y where N is 
the noise variance in the absence of non-linearities and C2 is 
another system-specific parameter The authors of |4| use ^ 
to accurately predict the channel capacities obtained from full- 
field numerical simulations reported in [5J, 16J. We make two 
observations. First, the WDM channels in jSl, 161 are made 
approximately memoryless by using reverse propagation to 
compensate dispersion. Second, the ^pn in (|7]i is approxi- 
mately Gaussian if ^pn is a sum of many weighted terms of 
the form 1^^.(0, T)|2, k = l,2,...,K, similar to ©-Q. 

III. Interference Network Model 

Equations Q-© and their generalizations to K frequen- 
cies motivate the following memoryless interference network 
model based on sampling the fields Ak{z, T), k = 1,2, . . . , K, 
at z = and z — L. Transmitter k sends a string of 
symbols — {Xks, Xk^2, • ■ ■ , Xk.n) while receiver k sees 
= iYk.i,Yk^2,- ■ ■ ,Yk,n)- We model the input-output 
relationship at each time instant j as 

Yk,j = Xk,j exp 1^ hkAXLj?^ + Zk,j (8) 



for k = 1,2,...,K where Zk^j is circularly-symmetric 
complex Gaussian noise with variance N . All noise random 
variables at different receivers and different times are taken 
to be independent. The terms exp(ift,fefe|Xfejp) model SPM 
and the terms exp(z/ife£|X£.j p), k ^ I, model XPM. We 
regard the hki as channel coefficients that are time invariant. 
These coefficients are known at the transmitters as well as 
the receivers, although we shall later see that we need local 
channel state information only. We use the power constraints 

1 " 

- VE[|Xfc,,f] <PW, fc = l,2,...,i^. (9) 

Definition 1: The p re-log Vk achieved by user k, whose 
information rate is Rk {P^^\ ■ ■ ■ , P^^\N), is 

rk = lim ' ' , ' ^ (10) 

P(i)/Ar,...,P(A-)/W^oo log(PW/iV) 

for k = 1, . . . ,K. Thus, a K-user transmission scheme may 
be studied by computing the pre-log if -tuple (ri, r2, . . . , r/f ). 

One achieves the pre-log A'-tuple (1/2, 1/2, . . . , 1/2) if all 
users use only amplitude modulation or only phase modula- 
tion. The main point of our work is to show that one can, in 
fact, achieve the ultimate pre-log A'-tuple (1,1,. ..,1). 

We again emphasize that we have ignored dispersion. Fur- 
thermore, the validity of ^ depends on the amplification, 
the network topology, the type of fiber, and so forth. For 
instance, when performing distributed amplification with stim- 
ulated Raman scattering, then additional phase noise should be 
included at high transmit powers. Also, our analysis assumes 
p(fe)/iV 00 for all fc, and it assumes perfect channel 
knowledge. Determining the capacity for finite P^'''> /N and 
partial channel knowledge are the subjects of ongoing work. 

TV. Two-User Interference Channel 

Consider the 2-user interference channel for which (O 
without the time indexes becomes 

Fi =Xiexp(z/in|Xi|2+i/ii2|^2p) +^1 (H) 

Y2 = X2exp {lh2l\Xi\^ + lh22\X2\^) + Z2. (12) 

We propose an interference focusing scheme in which the 
transmitters focus their phase interference on one point by 
constraining their transmitted signals to satisfy 

/i2i|Xip = 2mn, m = 1, 2, 3, . . . (13) 
ftialXal^ = 2m7r, m = 1, 2, 3, . . . (14) 

In other words, the transmitters use multi-ring modulation with 
specified spacings between the ringsQ We thereby remove 
XPM interference and (fTTli-(fT2]l reduce to 

yfc =Xfce*''"l^'=l' fc = l,2. (15) 

This channel is effectively an AWGN channel since hkk is 
known by receiver k and the SPM phase shift is determined 
by the desired signal Xk- 

'Multi-ring modulation was used in (2), (5), (6) for symmetry and compu- 
tational reasons only. We here find that it is useful for improving rate. 



It remains to show that the pre-log pair (ri,r2) — (1, 1) is 
achieved under the constraints (fr3Tl-(fT4]i. We show this in two 
steps: we first determine the information rate for one ring and 
then extend the analysis to many rings. 

A. One Ring 

Consider the AWGN channel Y ^X + Z with X = \/Pe'* 
where <i> is uniformly distributed on the interval [0, 27r). The 
achievable rate R is given by 

R = I{X; Y) = h{Y) - h{Y\X) 

= E[- logpy(r)] - log(^eiV) (16) 

The probability density of Y can be shown to be |2, p. 688] 



(17) 



where /o( ) is the modified Bessel function of the first kind 
of order zero and i/a = \y\- Therefore, we have 



hiY)=E -log|^-Le-(^I+^)/^/o(^ 
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Next, we derive an upper bound on Ia{z) that we will use 
in the process of lower bounding h{Y). 
Lemma 2: We have 



/o(.)<^^, .>0. 



(19) 



Proof: We have cos a; < 1 — ix^/n^ for < a; < 7r/2 by 
using the infinite product form for cosine 
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We thus have 
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^ /'^dx. Finally, we observe that 



Q{z) > and 0FeV(4Vi) > 1/2 for z > 0. 
Using Lemma ID in dTsl l. we have 
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The last expectation in ( |23] i is 
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Setting z = yA \/2/N and v'^ = 2P/N, expression (l24]l is 
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where r(a, x) is the upper incomplete Gamma function ||2l p. 
260] and where the second step follows by jSl. Inserting ( |25l ) 
into (|23] |. and then (|23] ) into ( fTSI l, gives 
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where we have made use of r(0,a;) > for a; > 00 The 
desired pre-log therefore satisfies 

RiP/N) 



lim 
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log(P/7V) 

B. Multiple Rings 

Consider multiple rings with X = 
where J is the number of rings. The power levels Pj allowed 
under interference focusing take the form mpQ where m is a 
positive integer and po is the minimum (non-zero) power level 
that depends on the channel coefficients. For example, for the 
2-user interference channel po — 27r/ft,2i for transmitter 1 and 
Po = 27r/ft,i2 for transmitter 2. The power levels must further 
satisfy E[|X|2] < P where P is the power constraint of the 
user being considered (our pre-log analysis is based on a point- 
to-point AWGN channel because interference is removed by 
interference focusing). The achievable rate R is given by 

R = IiX;Y)^I{XA,<S>;Y) 

^ I{Xa;Y) + I{^;Y\Xa) (29) 

where X — X^e'*. The term I{Xa',Y) can be viewed as 
the amplitude contribution while the term /($; is the 

phase contribution. 

Suppose that, for simplicity, we choose the rings to be 
spaced uniformly in amplitude as 



Pj = af Po 



(30) 



where a is a positive integer We further use a uniform 
frequency of occupation of rings with PxA{\/Pj) ~ ^/J, 
j = 1, 2, . . . , J. The power constraint is therefore 

jY.^'J^P^^P- ^31) 



Note that lim^ 



. r(0,a;) = 0. 



1} Phase Contribution: Using jZTl ). we have 
1 

Y\Xa) = Y.1 ^1^^ = 
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We show in the Appendix that by choosing a in (|30] l to scale 
as A^log(P/A^), and choosing J to satisfy ( [3T] i. then we have 



lim 



1 1 

2 



log(P/A^) - 2 



(34) 



The pre-log of the phase contribution is therefore at least 1/2. 
2} Amplitude Contribution: We have 



I{Xa;Y) ^ H{Xa) - H{Xa\Y) 



(35) 



where H{Xa) — log (J). We show in the Appendix that J 
scales as ^{P / N) /\og{P / N) if a scales as A^log(P/A^). 
We bound H{Xa\Y) using Fano's inequality as 



H{Xa\Y) < H{Xa\Xa) 

< H{P,) + P, log( J - 1) 



(36) 



where Xa is any estimate of Xa given Y, P^ = Pt[Xa 
Xa] and H{Pe) is the binary entropy function with a gen- 
eral logarithm base. Suppose we use the minimum distance 
estimator 



Xa = arg min \Ya 



xa\ 



where Ya = \Y\ and Xa = {^/P] : J = 1, ■ 
in the Appendix that 
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where Aj — {^/Pj — \/ Pj-i) /\fN ■ For the power levels dSOl), 
we have A, 
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if a scales as 



iVlog(F/iV) (recall that J scales as y/{P/N)/ \og{P/N)). 



We thus have limp/^_ 
Consequently, we have 
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We conclude from ( l29b , (O, and (l40b that interference 
focusing achieves the largest-possible pre-log of 1 . Each user 
can therefore exploit all the phase and amplitude degrees of 
freedom simultaneously. 



V. iiT-UsER Interference Network 

We outline how to apply interference focusing to problems 
with K > 2. Define the interference phase vector 



^1. = *2 



(41) 



where "^k = J2f=i^ke\Xe\'^ and the instantaneous power 
vector 

n^[|Xip,...,|X^P]^. (42) 

The relation between the ^ and n in matrix form is 

HspU + HxpR (43) 

where Hsp is a diagonal matrix that accounts for SPM and 
Hx p is a zero-diagonal matrix that accounts for XPM. 

Example 3: Suppose the XPM matrix for a 3-user interfer- 
ence network is 
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1/2 3/5 
3/4 2/3 
5/6 1/5 
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Suppose that each transmitter knows the channel coefficients 
between itself and all the receiving nodes. The transmitters 
can thus use power levels of the form 

n = 27r • [lcm(4, 6)toi, lcm(2, 5)m2, lcm(5, 3)m3 ] 
= 27r- [12mi,10m2,15m3] (45) 

where lcm(a, h) is the least common multiple of a and 6, and 
mi, 1712, are positive integers. We thus have 



(46) 



which implies that the phase interference has been eliminated. 

Example |3] combined with an analysis similar to Section 
|IV] shows that interference focusing will give each user a pre- 
log of 1 even for K-nsei interference networks. However, the 
XPM coefficients h^i must be rationals. Modifying interfer- 
ence focusing for real-valued XPM coefficients is an inter- 
esting problem. It is clear from Example [3] that interference 
focusing does not require global channel state information. 

VI. Conclusion 

We introduced an interference network model based on a 
simplified optical fiber model. We assumed that there was no 
dispersion, or that dispersion was compensated. The non-linear 
nature of the fiber-optic medium causes the users to suffer 
from amplitude-dependent phase interference. We introduced 
a new technique called interference focusing that lets the users 
take full advantage of all the available amplitude and phase 
degrees of freedom. Several generalizations are interesting to 
study further, e.g., introduce group velocity, focus interference 
on multiple points, study low and intermediate signal-to-noise 
ratio, investigate partial channel knowledge, and so on. 
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Appendix 

Phase Modulation with Multiple Rings 

We derive the key relations to prove that uniform phase 
modulation contributes 1/2 to the pre-log for specially-chosen 
multi-ring modulations. For dsTl ) we compute 

J 



(J + 1)(2J+1) 
6 



so to satisfy the power constraint we choos^ 

j_ -3+ v/l + 48P/(apo) 



(48) 



We choose a to scale as A^log(P/A^) so J scales as 
^J{P/N)/\og{P/N). Next, consider the sum in (1341 ). The 
logarithm is an increasing function so we have 
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= J {ln{aJ^po/N) - 2) log(e) (50) 



We can therefore write 
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^log{aJ^Po/N) ^ 1 
log(P/iV) 2 
(51) 

where ( fSTT l follows because a scales as \og{P/N), scales 
as (P/N)/ \og{P/N), and po is independent of P and N. 

Minimum Distance Estimator 
We derive the bound (|38] l for the estimator (|37| i. Let P^j 

/P". We have P, = 



be the error probability when Xa 

E/=i jPe.j and 



Pr Ya > 
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Pr{YA> 



otherwise. 
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Conditioned on Xa = \/Pj, Ya is a Ricean random variable, 
and hence we compute |9, p. 50] 



Pr Fa > 



VP3 
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(53) 



^The solution for J should be positive and rounded down to the nearest 
integer but we ignore these issues for notational simplicity. 



where Q{a,b) is the Marcum Q-function ifTol . Consider the 
following bounds. 

. Upper bound for 6 > a fJO', UBIMG] 

"-"'^^ (54) 



Q{a, b) < exp 

. Lower bound for 6 < a HOl LB2aS] 

Q{a,b) 



> 1 - - 
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(47) The bound ( fSSl l implies 
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where Aj 
equality 



{^/^j — ■y/Pj_i)/\/A'. Similarly, we use in- 
to write 
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Collecting our results, we have 
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